coordinated multi-agent imitation
19eca5979ccbb752778e6c5f090dc9b6-AuthorFeedback.pdf
Reviewer 1 Q: Novelty against [3]: The differences: (1) They do reinforcement learning, while we do imitation1 learning,whichismuchharder. Forexample, whytoassumeitisonlynecessary tomodel interactions within onetypeof6 agent (lines 151-152): Thank you for the suggestion. We will include them in the discussion. It shows that the agent learns to put different38 attention at different time step. In the morning (8 am) it puts more39 attention on 1-2 types of vehicles.